# Conversation Optimization
Google Gemma 3 12b It Qat GGUF
Gemma-3-12b model based on Google QAT (Quantization-Aware Training) weight quantization, offering multiple quantized versions to accommodate different hardware requirements.
Large Language Model
G
bartowski
10.78k
16
Orpheus 3b FT Q4 K M.gguf
Apache-2.0
Orpheus is a high-performance text-to-speech model, fine-tuned to achieve natural and emotionally rich speech synthesis. This repository hosts the 8-bit quantized version of the 3-billion-parameter model, optimizing operational efficiency while maintaining high-quality output.
Speech Synthesis Supports Multiple Languages
O
lex-au
736
2
Mlabonne Gemma 3 27b It Abliterated GGUF
A quantized version based on Google Gemma 3B model, optimized using llama.cpp, supporting multiple quantization levels, suitable for text generation tasks.
Large Language Model
M
bartowski
7,217
20
Llama 2 7b Chat Hf Q4 K M GGUF
GGUF quantized version of Meta's Llama 2 series 7B parameter chat model, suitable for local deployment and inference
Large Language Model English
L
matrixportal
220
4
Llama
Llama 2 is a 70-billion-parameter conversation-optimized large language model developed by Meta, surpassing most open-source dialogue models in benchmark tests with safety comparable to mainstream proprietary models
Large Language Model
Transformers English

L
TheCraftySlayer
48
1
Featured Recommended AI Models